Add output_format do video datasets and readers #6061

NicolasHug · 2022-05-20T16:16:44Z

This PR adds a new output_format to all video datasets, as well as VideoClips() and read_video(). The default is BC, so it corresponds to:

TCHW for Kinetics
THWC for everything else.

This is far from ideal, but this is the only way for users to have a consistent experience in torchvision. Right now, Kinetics is an outlier. For ref, the decision to have Kinetics follow the TCHW format comes from #3680 (comment).

torchvision/datasets/kinetics.py

torchvision/datasets/video_utils.py

datumbox

LGTM, thanks @NicolasHug!

Summary: Co-authored-by: Vasilis Vryniotis <[email protected]> Reviewed By: NicolasHug Differential Revision: D36760918 fbshipit-source-id: bfaa11b43cb0ebffb41b0e24fef1b6b65b6deef4

Add output_format do video datasets and readers

6d2fba2

facebook-github-bot added the cla signed label May 20, 2022

NicolasHug commented May 20, 2022

View reviewed changes

torchvision/datasets/kinetics.py Show resolved Hide resolved

datumbox reviewed May 20, 2022

View reviewed changes

torchvision/datasets/video_utils.py Show resolved Hide resolved

NicolasHug mentioned this pull request May 23, 2022

Keeping track of PRs and issues for upcoming 0.13 release #6071

Closed

9 tasks

datumbox approved these changes May 23, 2022

View reviewed changes

datumbox added enhancement module: video labels May 23, 2022

Merge branch 'main' into tchw_hate

f5be5b5

datumbox merged commit 4c66813 into pytorch:main May 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add output_format do video datasets and readers #6061

Add output_format do video datasets and readers #6061

NicolasHug commented May 20, 2022

datumbox left a comment

Add output_format do video datasets and readers #6061

Add output_format do video datasets and readers #6061

Conversation

NicolasHug commented May 20, 2022

datumbox left a comment

Choose a reason for hiding this comment